Skip to main content

More right-solid
More right-solid
More right-solid
More right-solid
SHOW DETAILS
up-solid down-solid
eye
Title
Date Archived
Creator
Community Audio
Jan 28, 2020
audio
eye 62
favorite 0
comment 0
A night of Neil Young cover songs held at Swarthmore College some time in 1998. Better to burn out, than to fade away.
Topics: neil young, rock, drunk rock
WEWA domain crawls
Nov 14, 2019
image
eye 5
favorite 0
comment 0
wewaglobe
Topic: wewa
fatcat
collection
0
ITEMS
23
VIEWS
Jun 7, 2019
collection
eye 23
fatcat!
Topics: fatcat, open access
WASAPI Project - Web Archiving Systems APIs
Apr 17, 2019
texts
eye 25
favorite 0
comment 0
Final performance report for the WASAPI project, IMLS grant LG-71-15-0174-15, "Systems Interoperability and Collaborative Development for Web Archiving.
Topic: web archiving
WASAPI Project - Web Archiving Systems APIs
Dec 3, 2018 Jefferson Bailey
texts
eye 22
favorite 0
comment 0
Year 2 performance report for the WASAPI project, IMLS grant LG-71-15-0174-15, "Systems Interoperability and Collaborative Development for Web Archiving.
Topics: web archiving, digital preservation, APIs
Web Archive Domain Files
data
eye 75
favorite 0
comment 0
dotnl-2016-present-domains-in-wayback-domain+year-of-last-capture
Topics: domains, web archive, lists
Web Archive Domain Files
collection
1
ITEMS
856
VIEWS
Apr 18, 2018 IA Web Archiving Group
collection
eye 856
Random set of domain lists extracted from the IA web archive, usually ccTLD lists, often at the request of partners, especially National Libraries. File name will (hopefully) describe file contents. Posted "as is" so you get what you get.
Topics: web archive, domains, zone file
The Early .gov Web Domain
collection
6
ITEMS
214
VIEWS
Dec 15, 2017
collection
eye 214
A special research collection of the entire .gov web domain from 1996-2001 as represented within IA's web archive.
Topics: government, web, research
Web Data Services
collection
684
ITEMS
365,120
VIEWS
Dec 15, 2017
collection
eye 365,120
Datasets, special collections, and other derived and extracted subsets of web data culled from IA's web archive. Many of these datasets were created in relation to specific partnerships and collaborative projects supporting computational research and data mining using web archives.
Topics: research, dataset, web
web_locrl
collection
1,562
ITEMS
101,528
VIEWS
Oct 20, 2017
collection
eye 101,528
Files related to general web crawls.
Topic: web
Corporation Websites Collection
collection
639
ITEMS
584,310
VIEWS
Sep 21, 2017
collection
eye 584,310
This collection contains an extracted web archive corpus of 0.8+ million corporate websites (from an original list of ~0.98 websites) extracted from the archive.org web archive, covering the period 1996 to early 2017. This corpus was originally created as a collaboration between the Internet Archive and a group at Dartmouth University, but it may be useful to other researchers. Updated or more detailed information may exist at:...
Topics: websites, corporations, homepages
End Of Term 2016 UNT Crawls
collection
1,275
ITEMS
7.6M
VIEWS
Aug 28, 2017
collection
eye 7.6M
End of Term 2016 Web Archive government web crawls by project partner the University of North Texas.
Topics: end of term, federal government, 2016, president, congress, university of north texas
Web Archive Reading Rooms
Jun 22, 2017 Jefferson Bailey
image
eye 59
favorite 0
comment 0
Viewing the first capture of https://www.kb.nl/ homepage as found in the onsite-only-access web archive of the Koninklijke Bibliotheek (the National Library of the Netherlands) in the Hague, Netherlands in June 2017. (Technically via the in-building network from the desk of Kees Teszelszky. Thank you Kees!)
Topics: web archiving, open access, digital preservation
Web Archive Reading Rooms
Jun 22, 2017 Jefferson Bailey
image
eye 45
favorite 0
comment 0
Viewing a 1990s-era archived archive.org webpage as found in the onsite-only-access web archive of the Bibliothèque nationale de France François-Mitterrand Library in Paris, France in June 2017.
Topics: web archiving, open access, digital preservation
Web Archive Reading Rooms
collection
2
ITEMS
128
VIEWS
Jun 22, 2017 Jefferson Bailey
collection
eye 128
Promoting online access to web archives by posting, online, pictures of me looking at reading-room-access-only web archives. 
Topics: web archiving, open access, digital preservation
Jun 10, 2017 Jefferson Bailey
collection
eye 336
Materials from the National Symposium on Web Archiving Interoperability, part of the WASAPI project, held February 21-22, 2017 at Internet Archive, San Francisco, CA. Additional, web-based presentations: https://wayback.archive-it.org/9135/20170712185836/http://labs.rhizome.org/presentations/wasapi-symposium-2017.html#/ https://wayback.archive-it.org/9135/20170712185836/http://www.gregwiedeman.com/presentations/slides/wasapi.html#/
Topics: web archiving, wasapi, digital preservation
WASAPI Project - Web Archiving Systems APIs
Jun 8, 2017 Stanford University Libraries
movies
eye 47
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, digital preservation, APIs
WASAPI Project - Web Archiving Systems APIs
May 27, 2017 Stanford University Libraries
movies
eye 48
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, APIs, digital preservation
WASAPI Project - Web Archiving Systems APIs
May 27, 2017 Stanford University Libraries
movies
eye 54
favorite 0
comment 0
Video demonstration of work on utilities built by Stanford University Libraries using the WASAPI data transfer API.
Topics: web archiving, APIs, digital preservation
WASAPI Project - Web Archiving Systems APIs
May 26, 2017 Jefferson Bailey
texts
eye 127
favorite 0
comment 0
Year 1 performance report for the WASAPI project, IMLS grant LG-71-15-0174-15, "Systems Interoperability and Collaborative Development for Web Archiving."
Topics: web archiving, digital preservation, APIs
WASAPI Project - Web Archiving Systems APIs
collection
14
ITEMS
772
VIEWS
May 26, 2017
collection
eye 772
Collection of materials related to the WASAPI project, an IMLS-funded initiative on " Systems Interoperability and Collaborative Development for Web Archiving ." Code and other materials can also be found at the project's Github page . 
Topics: web archiving, digital preservation, APIs
End Of Term 2016 Library of Congress Crawls
collection
3,892
ITEMS
3.9M
VIEWS
May 14, 2017
collection
eye 3.9M
End of Term 2016 Web Archive government web crawls by project partner the Library of Congress.
Topics: end of term, federal government, 2016, president, congress, library of congress, web, data, library...
SanFranciscoBayGuardianCrawl
collection
1
ITEMS
7,752
VIEWS
Apr 12, 2017
collection
eye 7,752
Crawl of SFBG done at the request of the paper in 2014.
Topics: news, san francisco
Government Web & Data Archive
collection
6,269
ITEMS
8.8M
VIEWS
Apr 6, 2017
collection
eye 8.8M
This collaborative project is an extension of the 2016  End of Term  project, intended to document the federal government's web presence by archiving government websites and data. As part of this preservation effort, URLs supplied from partner institutions, as well as nominated by the public, will be crawled regularly to provide an on-going view of federal agencies' web and social media presence. Key partners on this effort are the Environmental Data & Governance...
Topics: government, data, federal, congress
Apr 2, 2017
collection
eye 5,073
The Environmental Data & Governance Initiative (EDGI)  is an international network of academics and non-profits addressing potential threats to federal environmental and energy policy, and to the scientific research infrastructure built to investigate, inform, and enforce. More information is available at  https://envirodatagov.org/
Topics: data, environment, climate, government
NYPL Labs Archive
collection
0
ITEMS
2,508
VIEWS
Apr 2, 2017
collection
eye 2,508
Archive of NYPL Labs.  https://web.archive.org/web/*/https://www.nypl.org/collections/labs
Topics: libraries, labs, digital, innovation
web_domain_tests
collection
8,159
ITEMS
29.7M
VIEWS
Mar 28, 2017
collection
eye 29.7M
WARCs from internal crawl testing.
Topics: web, cctld
Obama White House Social Media Archive
collection
8
ITEMS
836
VIEWS
Jan 3, 2017 The Obama White House
collection
eye 836
Collection of the official social media of the Obama White House, made available for public access in January 2017 prior to the end of the Obama administration by the White House Office of Digital Strategy. The collection includes data from Twitter, Tumblr, Vine, Facebook, and YouTube (pending). For more information on the public release of Obama's social media data, see these White House posts: ...
Topics: obama, white house, government data, social media
Community Video
Dec 29, 2016 Vinay Goel
movies
eye 73
favorite 0
comment 0
A timelapse of the whitehouse.gov homepage from Jan 20, 2009 to Dec 19, 2016 using the Internet Archive Wayback Machine and web archive. Created for the White House Social Media Data Hackathon at Internet Archive on Jan 7, 2017.
Topics: whitehouse, obama, timelapse
End of Term 2016 Post-Inauguration Crawls
collection
5,308
ITEMS
3.6M
VIEWS
Dec 15, 2016
collection
eye 3.6M
This collection contains web crawls performed as the post-inauguration crawl for part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Winter of 2016  and Spring of 2017 to capture websites...
Topics: end of term, federal government, 2016, president, congress
End Of Term 2016 Pre-Inauguration Crawls
collection
4,693
ITEMS
9.5M
VIEWS
Dec 15, 2016
collection
eye 9.5M
This collection contains web crawls performed as the pre-inauguration crawl for part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Fall and Winter of 2016 to capture websites prior to the January...
Topics: end of term, federal government, 2016, president, congress
End of Term 2016 Web Crawls
collection
19,084
ITEMS
25.4M
VIEWS
Nov 21, 2016
collection
eye 25.4M
This collection contains web crawls performed as part of the End of Term Web Archive, a collaborative project that aims to preserve the U.S. federal government web presence at each change of administration. Content includes publicly-accessible government websites hosted on .gov, .mil, and relevant non-.gov domains, as well as government social media materials. The web archiving was performed in the Fall and Winter of 2016 and Spring of 2017. For more information, see...
Topics: end of term, federal government, 2016, president, congress, government data
Community Video
Oct 25, 2016 Trevor Thornton
movies
eye 1,097
favorite 1
comment 1
Data visualization of animated GIFs from the Geocities web archive. The visualization was done by staff at the Hunt Library at North Carolina State University and displayed in their data visualization lab. The project was done in association with the Internet Archive's 20th Anniversary celebrating the history of archiving the web.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topics: geocities, animated gifs, data visualization
Military Industrial Powerpoint Complex
collection
1,272
ITEMS
767,435
VIEWS
Oct 21, 2016 United States Military
collection
eye 767,435
This collection was a special project originally done as part of the Internet Archive's 20th Anniversary celebration on October 26, 2016 highlighting IA's web archive. The collection consists of all the Powerpoint files (57,489) from the .mil web domain that were crawled from the public web (with no special login or credentials) by the Internet Archive and partners from 1996-2017. The original release in October 2016 featured 48,110 Powerpoint files. Another 9,379 unique new Powerpoint files,...
Topics: military, industry, powerpoint, complex
SM feed test
Jul 15, 2016
web
eye 26
favorite 0
comment 0
Community Video
Jun 15, 2016
movies
eye 92
favorite 0
comment 0
Web Archive sculpture donated from Internet Archive to Library of Congress in 1998. Video from 2016, noting it's semi-functional status and location in staff offices in the Adams building. 
Topics: wayback, web archives, art, sculpture
Flickr Commons Archive
May 11, 2016
image
eye 41
favorite 0
comment 0
cuba flag
Topic: cuba
Estonian Web Domain
collection
0
ITEMS
23
VIEWS
Jan 31, 2016 Jefferson Bailey
collection
eye 23
Overview of the historical Estonian web domain (.ee).
Topics: web, estonia
Cuban Web Domain Crawl
collection
385
ITEMS
2.1M
VIEWS
Dec 29, 2015
collection
eye 2.1M
This collection is a snapshot of the Cuban Web Domain (.cu) from early 2016.
IMLS Museum Universe Data File Crawl
collection
2,885
ITEMS
22.5M
VIEWS
Dec 19, 2015
collection
eye 22.5M
2015 crawl of museum websites listed in the IMLS Museum Universe Data File. More about the IMLS MUDF can be found at https://www.imls.gov/research-evaluation/data-collection/museum-universe-data-file
Topic: AIT
Internet Archive Presents
Nov 3, 2015 Jefferson Bailey, Maria LaCalle
texts
eye 137
favorite 0
comment 0
Presentation by Jefferson Bailey and Maria LaCalle at the Preservation Metadata Interest Group at the ALA 2015 conference in San Francisco (Frisco) California.
Topics: web archiving, metadata, digital preservation, archives, libraries
Community Texts
Jun 9, 2015
texts
eye 25
favorite 0
comment 0
Lat-Lon CSV for ARS workshops
Topic: data
Community Video
Feb 13, 2015 Jefferson Bailey, Herbert Van de Sompel
movies
eye 55
favorite 0
comment 0
Slides and audio from Jefferson Bailey, Program Manager, Internet Archive and  Herbert Van de Sompel, Digital Library Research & Prototyping, Los Alamos National Laboratory as part of the "Strategies I" panel from the Georgetown Law Library symposium "404/File Not Found: Link Rot, Legal Citation and Projects to Preserve Precedent" on October 24, 2014 in Washington, D.C. Uploaded by me, Jefferson, so we could embed it in a blog post -- not as some weird vanity thing! I...
Topics: web archiving, digital preservation, archives, link rot, libraries
Community Video
Dec 17, 2014 jefferson_bailey
movies
eye 383
favorite 0
comment 0
A mash-up supercut of all the ducking and covering in the Federal Civil Defense film "Duck and Cover" ( https://archive.org/details/gov.ntis.ava11109vnb1 ) scored with "Silent Night" by @nullsleep of 8bitpeoples from "The 8bits of Christmas" ( https://archive.org/details/8bp038 ). Made for holiday season 2014 -- donate to Internet Archive! https://archive.org/donate/
Topics: ducking, covering, chiptune, holiday, nuclear apocalypse
Ferguson Tweets
Nov 26, 2014 Ed Summers
data
eye 41
favorite 0
comment 0
417,972 URLs and corresponding tweet IDs that tweeted that URL, all in a fat old .tsv extracted from Ed Summers' collection of 13,480,000 tweet IDs that mentioned 'ferguson' from 2014-08-10 22:44:43 to 2014-08-27 15:15:50 that is here: https://archive.org/details/ferguson-tweet-ids. Thanks to Ed Summers for doing this!
Topics: twitter, ferguson
Ferguson Tweets
Nov 25, 2014 Ed Summers
data
eye 69
favorite 0
comment 0
417,972 URLs extracted from Ed Summers' collection of 13,480,000 tweet IDs that mentioned 'ferguson' from 2014-08-10 22:44:43 to 2014-08-27 15:15:50 listed here: https://archive.org/details/ferguson-tweet-ids
Topics: twitter, ferguson
More Podcast, Less Process
collection
10
ITEMS
1,226
VIEWS
Sep 22, 2014
collection
eye 1,226
More Podcast, Less Process
Topic: listmania
More Podcast, Less Process
Jun 9, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 141
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, libraries, digitization, preservation, performing arts
More Podcast, Less Process
May 6, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 78
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information:...
Topics: archives, preservation, podcast, history, electronic records, formats, records management
More Podcast, Less Process
Mar 31, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 110
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, video, media, film, preservation, conservation, libraries
More Podcast, Less Process
Feb 24, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 251
favorite 1
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information:...
Topics: archives, libraries, web archiving, digital preservation, columbia, new york art resources...
More Podcast, Less Process
Jan 30, 2014 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 86
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More  information: ...
Topics: archives, libraries, special collections, research, genealogy
More Podcast, Less Process
Dec 23, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 80
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. More information:...
Topics: archives, libraries, special collections, film preservation, research
More Podcast, Less Process
Dec 9, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 62
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 004 features archivist,...
Topics: archives, libraries, special collections, preservation, conservation
More Podcast, Less Process
Nov 11, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 98
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 003 features  Janet Bunde,...
Topics: archives, libraries, special collections, education, research
More Podcast, Less Process
Oct 22, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 119
favorite 0
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 002 features Grace Lile and...
Topics: archives, libraries, special collections, preservation, conservation, activism, advocacy
More Podcast, Less Process
Oct 7, 2013 Metropolitan New York Library Council and AudioVisual Preservation Solutions
audio
eye 201
favorite 2
comment 0
More Podcast, Less Process is a podcast featuring interviews with archivists, librarians, preservationists, technologists, and information professionals about interesting work and projects within and involving archives, special collections, and cultural heritage. Topics include appraisal and acquisition, arrangement and description, reference, outreach and education, collection management, physical and digital preservation, and infrastructure and technology. Episode 001 features Mark Matienzo,...
Topics: archives, libraries, special collections, museums, preservation, conservation, digital...
Community Texts
Aug 11, 2013 Joseph Cuvelier, L. Stainer
texts
eye 1,795
favorite 1
comment 0
Proceedings of the International Congress of Archivists and Librarians held in Brussels in 1910. Multiple languages, mostly English, French, and Dutch. Table of contents is at end page 479/484 of pdf (pg 804 in the original work).
Topics: archives, libraries